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DISCUSSION: CONDITIONAL GROWTH CHARTS 

By Matias Salibian-Barrera 1 and Ruben H. Zamar 2 
University of British Columbia 

The authors are to be congratulated for a very important contribution 
with many practical applications. Including covariates in the construction 
of growth charts will undoubtedly lead to more informative tools for practi- 
tioners in many disciplines. 

Growth charts are widely used in practice to monitor the evolution of par- 
ticular univariate measurements over time. In some situations, a patient's 
evolution may be better described by the joint behavior of these variables of 
interest. For example, instead of using two univariate growth charts to map 
the weight and height of children, the physician may prefer to simultane- 
ously locate the child's measurements with respect to the joint population 
distribution of weight and height for children of the same age cohort. It is 
well known that biological variables are generally correlated and that these 
correlations may be important to determine multivariate boundaries for the 
"normal" range of the response vector. When measurements are correlated, 
univariate growth charts may be unable to represent interesting combined 
features of the variables of interest. 

We can identify the following challenges when one tries to develop multi- 
variate growth charts: 

(a) choosing an appropriate definition of multivariate quantiles; 

(b) modeling multivariate quantiles to include the subject's prior devel- 
opment history and other covariates; and 

(c) devising visualization tools to display individual trajectories with re- 
spect to the reference populations. 

Regarding item (a) , a nice unified presentation of several definitions of mul- 
tivariate quantiles along with an insightful account of desirable properties is 
given in [2] . A proper extension of Wei and He's model to the multivariate 
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setting [which would address (b) above] is of great interest but beyond the 
scope of this note. We will focus our discussion on item (c) for the simple 
case where the only covariate is time. 

For simplicity of presentation, in what follows we will restrict our attention 
to the bivariate case and use quantiles based on Tukey's half-space depth 
[3]. For a random sample xi, . . . , x n , depth-based multivariate quantiles can 
be obtained as follows: 

(i) first, for each observation Xj compute its half-space depth iJD(xj), 
which is defined as the smallest fraction of observations included in a closed 
half-space with boundary line that passes through x»; 

(ii) define the multivariate quantile of x$ as the sample quantile of HD(xj) 
among all the n half-space depths HD(xj), j = 1, . . . , n. 

Figure 1 shows a bivariate dataset where "extreme" points (those with quan- 
tiles smaller than 0.05) are indicated with circles, and innermost points (cor- 
responding to quantiles larger than 0.95) are shown with x's. Note that this 
definition is closely related to the following population definition of depth- 
based median-oriented multivariate quantiles [2]. The half-space depth of a 
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Fig. 1. A random sample of size n = 1000 from a bivariate normal distribution. Points 
with depth-based multivariate quantiles lower than 0.05 are shown with o 's, and those with 
quantiles higher than 0.95 are displayed with x 's. 
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vector x with respect to a reference probability measure P is denned as 

HD(x, P) = m£{P(H) : H is a closed half-space and x € H}. 

Following [2] , the depth-based median-oriented multivariate pth quantile is 
the boundary of the set I(jp,P), where 

7 P = sup{ 7 >0:P(J(7,P))>p} 

and 

I( 1 ,P)={x:HD(*,P)> 1 }. 

Multivariate quantiles are, of course, difficult to visualize when the dimen- 
sion of the data is higher than 2 or 3. Since the main objective of growth 
charts is to locate the trajectory along time of a particular individual with 
respect to the corresponding different reference populations, we propose to 
use a series of univariate plots, one for each time point where the individual 
of interest has been observed. 

More specifically, assume that we observe a vector of p measurements 
x(i) € W, at k time points ti,...,tk- Denote these observations by x(tj), 
i = 1, . . . , k. Furthermore, assume that we have k reference populations from 
the distribution of the vector of measurements of interest X(t) at the same 
times £j, i = 1, . . . ,k. For each time ti, let be the corresponding mul- 
tivariate quantile of the observed x(ij) with respect to the corresponding 
reference populations. Let a € MP with ||a|| = 1 be any unit vector, and con- 
sider the projections a'x(tj), i = 1, . . . , k. Denote by ^(a) the corresponding 
(univariate) quantiles of the projections a'x(tj) with respect to the projected 
reference populations. We propose to find the vector ao for which the re- 
sulting g^(ao)'s are closest to the multivariate quantiles q^, i = 1, . . . ,k. 
Because in many applications the observations x(ti), . . . , x(ifc) correspond 
to measurements taken over time on a particular patient, we will call this 
optimal vector ao the "patient-specific direction." In other words, we define 

k 

[% - % (a)] , 

l|a||=l j= i 

and use this direction to find univariate reference populations for which the 
corresponding projection of the measurements of the "patient of interest" 
are closest to the multivariate ones. Note that this patient-specific optimal 
direction (common for all time points) may provide some insight into which 
combination of the patient's measurements best describes the relative posi- 
tion of this individual in the multivariate reference populations. For example, 
if this patient's multivariate quantiles are becoming more "extreme" with 
time, the patient-specific optimal direction ao may help the physician under- 
stand in which way this patient is deviating from the bulk of the reference 
populations. 
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We can now use different graphical tools to display the relative position 
of the individual of interest with respect to the projected reference popula- 
tions. In the example below we use boxplots. Alternatively, one could use 
histograms or kernel density estimators. 

Example. We will illustrate the main ideas of our visualization pro- 
posal with a synthetic example. Although our proposal is aimed at situa- 
tions involving several variables, in what follows we only consider bivariate 
observations to be able to plot the multivariate data and their projections. 

We generated k = 4 reference samples of size 1000 from a bivariate nor- 
mal distribution with constant correlation equal to 0.77 and variances 1 and 
2.44. The mean vectors change over time from (5,5)' to (10.5,9)'. In Fig- 
ure 2 we display the four datasets along with the bivariate measurements 
corresponding to a "patient of interest," indicated with solid circles. The 
half-space depth bivariate quantiles for this patient are 0.79, 0.38, 0.07 and 
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Fig. 2. Reference samples of size 1000, arf times ti, t-2, tz andt^. The measurements for 
a "patient of interest" are indicated with solid circles. 
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Fig. 3. Relative position of the " patient of interest" (indicated with a solid circle) with 
respect to the reference samples projected on the patient-specific optimal direction. 



0.02 at times t±, t2, t% and £4, respectively. Clearly this patient is becoming 
atypical as time progresses. 

To find ao in (1.1) we used a grid of 500 directions with equally spaced 
angles between and tt. Note that the multivariate quantiles qt only need 
to be computed once, regardless of the number of directions used in the nu- 
merical optimization. The bivariate half-space depths were computed using 
the AS 307 algorithm [1]. The optimal ao = (0.72, —0.69)' and the univariate 
quantiles q~tj(&o) were found to be 0.79, 0.38, 0.09 and 0.02, in close agree- 
ment with the multivariate ones. It is interesting to note that the "source" of 
this patient's increasing "unusualness" seems to be explained by a contrast 
between the first and second measurements. If these were height and weight, 
for example, we could conclude that this patient is becoming disproportion- 
ally tall for his/her weight. 

Figure 3 shows the position of the "patient of interest" with respect to 
the reference samples projected on the patient-specific optimal direction ao 
The corresponding projections of the point of interest are indicated with a 
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(a) Reference sample at time ii 
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(c) Reference sample at time t3 
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Fig. 4. Relative position of the "patient of interest" (indicated with a solid circle) with 
respect to the two univariate coordinates of the reference samples. 



solid dot. We can clearly see that this point is noticeably deviating from the 
bulk of the population (crossing quantile boundaries) as time progresses. 
Furthermore, Figure 4 shows the relative position of the "patient of inter- 
est" using the two original variables. Note that, as it is to be expected, the 
univariate approach fails to detect the unusual behavior of this patient's 
measurements. This last figure illustrates the potential usefulness of mul- 
tivariate growth charts compared with the analysis of several univariate 
growth charts. 
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